Selectivity Estimation of Inequality Joins in Databases
نویسندگان
چکیده
Selectivity estimation refers to the ability of SQL query optimizer estimate size results a predicate in query. It is main calculation based on which can select least expensive plan execute. While problem has been known since mid-1970s, we were surprised that there are no solutions literature for selectivity inequality joins. By testing four common database systems: Oracle, SQL-Server, PostgreSQL, and MySQL, found open-source systems PostgreSQL MySQL lack this estimation. Oracle SQL-Server make fairly accurate estimations, yet their algorithms secret. This paper, thus, proposes an algorithm join The proposed was implemented sent as patch be included next releases. We compared implementation with above DBMS three different data distributions (uniform, normal, Zipfian) showed our provides extremely estimations (below 0.1% average error), outperforming other by order magnitude.
منابع مشابه
Selectivity Estimation for Spatial Joins
Spatial Joins are important and time consuming operations in spatial database management systems. It is crucial to be able to accurately estimate the performance of these operations so that one can derive efficient query execution plans, and even develop/refine data structures to improve their performance. While estimation techniques for analyzing the performance of other operations, such as ra...
متن کاملSelectivity Estimation for Joins Using Systematic Sampling
We propose a new approach to the estimation of join selectivity. The technique, which we have called “systematic sampling”, is a novel variant of the sampling-based approach. Systematic sampling works as follows: Given a relation R of N tuples, with a join attribute that can be accessed in ascending/descending order via an index, if n is the number of tuples to be sampled from R, select a tuple...
متن کاملSelectivity Estimation for Spatial Joins with Geometric Selections
Spatial join is an expensive operation that is commonly used in spatial database systems. In order to generate efficient query plans for the queries involving spatial join operations, it is crucial to obtain accurate selectivity estimates for these operations. In this paper we introduce a framework for estimating the selectivity of spatial joins constrained by geometric selections. The center p...
متن کاملSelectivity Estimation in Extensible Databases - A Neural Network Approach
Extensible database systems allow users to create new data types representing spatial, text, image, and other multimedia objects, and functions which operate on the new types[S96]. The new data types and functions can be used as predicates in SQL queries. Consider an insurance industry application, built using an extensible DBMS that has user defined extensions for spatial, text, date/time, and...
متن کاملLightning Fast and Space Efficient Inequality Joins
Inequality joins, which join relational tables on inequality conditions, are used in various applications. While there have been a wide range of optimization methods for joins in database systems, from algorithms such as sort-merge join and band join, to various indices such as B-tree, R⇤-tree and Bitmap, inequality joins have received little attention and queries containing such joins are usua...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Mathematics
سال: 2023
ISSN: ['2227-7390']
DOI: https://doi.org/10.3390/math11061383